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Abstract 



In this paper, a compress- and- forward scheme with backward decoding is presented 
for the unicast wireless relay network. The encoding at the source and relay is a gen- 
eralization of the noisy network coding scheme (NNC) [1]. While it achieves the same 
(713 1 reliable data rate as noisy network coding scheme, the backward decoding allows for 

^ ' a better decoding complexity as compared to the joint decoding of the NNC scheme. 

Characterizing the layered decoding scheme is shown to be equivalent to characterizing 
. an information flow for the wireless network. A node-flow for a graph with bisubmodu- 

\^ I lar capacity constraints is presented and a max-flow min-cut theorem is presented. This 

generalizes many well-known results of flows over capacity constrained graphs studied 
in computer science literature. The results for the unicast relay network are generalized 
to the network with multiple sources with independent messages intended for a single 
destination. 



1 Introduction 

X 

^ \ The primary focus of this paper is a unicast wireless relay network: a single source node 

^ ■ intends to communicate reliably with a single destination node with the assistance of many 

relay nodes. The communication channels are wireless; transmitted signals from a node are 
broadcasted to all other nodes; received signals at a node is a linear superposition of the 
transmit signals with a random additive noise, which has the familiar Gaussian distribution. 

In [2] a quantize-map-forward scheme was presented for the wireless relay network. It was 
shown that this scheme is approximately optimal, i.e. it gives a reliability criterion for rates 
within a constant gap of the cutset bound, where the constant gap depends only on the size 
of the network and not on the channel parameters. In this scheme, each node quantizes the 
received signal, symbol by symbol, at the noise level. The quantized symbols accumulated 
together in a block are then mapped to a transmit codeword at that node. These transmission 
codebooks at every node are generated independently of each other. 
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In [3], a related scheme was presented for the wireless relay network. Here, the coding and 
quantization is done in a structured manner using lattices. The scheme was shown to achieve 
performance similar to the quantize-map-forward scheme of [2] in terms of the reliable rates. 

In [1], a noisy network coding scheme in the more general setting of the discrete memo- 
ryless network was presented for the unicast relay network and also generalized to the case 
of multicast and multiple sources with single destination. In this scheme, the relay quantizes 
the received signal in blocks using vector-quantization, subsequently mapping each quan- 
tized codeword to a unique codeword, which is re-transmitted by the relay. Specialized to 
the wireless network, the noisy network coding can be thought of as a vector version of the 
quantize-map-forward scheme, where each relay does a vector quantization rather than the 
scalar quantization proposed in [2]. 

In [1], an alternate approach was provided, wherein the discrete superposition network 
was used as a digital interface for the wireless network and the scheme was constructed by 
lifting the scheme for the discrete superposition network. The discrete superposition network 
provided the quantization interface for this scheme. 

In this paper, a compress- and- forward scheme is presented for a relay network in the 
general setting of the discrete memoryless network. This encoding is similar to the noisy 
network scheme, but the relay mapping is generalized, so that the relay node compresses the 
received signal in blocks, on top of the vector quantization in NNC The additional compression 
does not increase the achievable rate beyond the rate achievable by NNC; however, the first 
main result of this paper is that, if the compression rates are chosen appropriately then a lower 
complexity backward decoding achieves approximately the same rate. The above result was 
also proved independently in [5]. The second important result in this paper is to show that 
this appropriate choice of compression rates can be computed efficiently by computing a node- 
flow on a bisubmodular capacitated graph. The flow formulation captures the rate of actual 
information that should flow through each node to support a given rate of flow of information 
from the source to the destination. In other words, this paper shows that backward decoding 
does almost as good as joint decoding, if the relay nodes compress their signals to capture the 
right amount of information that should flow through that given the network topology. 

The paper presents a max-flow min-cut result for a node-flow on a bisubmodular capac- 
itated graph. This is related to many well-known results of flows over capacity constrained 
graphs studied in computer science literature, albeit with two differences; the first one being 
that the fiow is defined over nodes rather than the conventional approach of defining over 
edges; and the second is that the graphs are restricted to layered graphs alone. The first dif- 
ference is a fundamental difference. Flows over graphs are conventionally defined as numbers 
over edges of the graph, such that for every node the incoming-fiow is equal to the outgoing- 
fiow. Since the motivation here is to model the wireless network where there are no physical 
edges, it is more appropriate to define node-fiow rather than edge-fiow; the relation being 
that the node-fiow represents the incoming-fiow or outgoing-fiow at the node. The second is 
less fundamental and the restriction to layered graphs is done only because the block-coding 
scheme for the relay network can be studied by considering a virtual layered network, the 
layering offers a convenient way of defining the bisubmodular capacity functions on the layered 
graph. 
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Models Random Coding Schemes Structnred Coding Schemes 




Figure 1: A depiction of the communication schemes on the Gaussian and hnear deterministic 
networks. The main result of this paper is represented by the upper-right bubble in red. 

The bisubmodular capacitated graph presented here is motivated by the ideas of linking 
systems and flows introduced in [6l [71 [3 Ej in the context of the linear deterministic network. 
The linear deterministic network was introduced in |2] as a model that captures many features 
of the wireless network. Random coding argument was used to show the existence of schemes 
that achieve capacity of the linear deterministic network [H [2]. On the other hand [HI [7] 
developed a polynomial time algorithm that discovers the relay encoding strategy using a 
notion of linear independence between channels. Taking this concept forward, in [HI |9], the 
concept of flow was introduced for the linear deterministic network. The flow value at each 
node in this network corresponds to the number of independent equations, that particular 
node needs to forward. The result in this paper can be viewed as a loose analog of these 
results in the context of the Gaussian network; see Figure [U The additional structure of 
the linear deterministic channel, is used in [HI El El IS] to show that a single-block coding 
scheme where a simple permutation matrix at each node mapping the received vector to the 
transmit vector is optimal. Both the flow values at the node and the permutation mapping 
were constructed in polynomial time. 

The rest of the paper is organized as follows. In Section [2] the compress- and-forward scheme 
for the relay network is described and characterized. A lower-complexity layered decoding is 
presented and the achievable rates are characterized. It is shown that this decoding scheme 
does as well as the joint decoding scheme. To prove this result, the notion of node-flows for 
a bisubmodular capacitated graph is developed in Section [31 In Section [H the results are 
generalized to the network with multiple sources with independent messages intended for a 
single destination. In Section \5\ we discuss the ramifications of our algebraic flow formulation 
to the important special cases of the Gaussian wireless relay network and the deterministic 
relay network. 
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2 Unicast Relay Network 



A communication network is represented by a set of nodes V. Each node in the network 
abstracts a radio, which can both transmit and receive (in full or half duplex modes). The 
traffic is unicast a single source node is communicating reliably to a single destination node 
using the other nodes in the network as relays. We will be interested in a single-source single- 
destination relay network, which has a unique source node s and destination node d and the 
other nodes function as relay nodes. At any node v, the transmit alphabet is given by 
and the receive alphabet by (supposed to be discrete sets, for the most part). Time is 
discrete and synchronized among all nodes. The transmit symbol at any time at a node v 
is given by x„ G and the receive symbol is given by y„ G X>- Memoryless network will 
be considered here wherein the received symbol at any node at any given time depends (in a 
random fashion) only on the current transmitted symbols at other nodes. 

A (2™, T) coding scheme for the relay network, which communicates over T time instants, 
comprises of the following. 

1. The message W, which is modeled as an independent random variable distributed uni- 
formly on [2-^^]. W is known at the source node and is intended for the destination 
node. 

2. The source mapping for each time t G [T], 

Ut : {W X y-') ^ X,. (1) 

3. The relay mappings for each v G V\ {s} and t E [T], 

fv,t : ^ X,. (2) 

4. The decoding map at destination d, 

9d : 3^J ^ W. (3) 
The probability of error for destination d under this coding scheme is given by 

Pe = Vi{Wt^W]. (4) 

A rate R (in bits per unit time) is said to be achievable if for any e > 0, there exists a (2-^'^, T) 
scheme that achieves a probability of error lesser than e for all nodes, i.e., Pg < e- The capacity 
of the network is the supremum of all achievable rates. 

It was shown in [2] that any arbitrary communication network can be converted into a 
layered network by coding over blocks of time. Each layer then captures the operations in 
the corresponding block of time. Further, if the nodes have half-duplex constraint, then 
this time-layering is done with a fixed transmit-receive schedule, which says which nodes are 
transmitting and which ones are listening in any block of time. It is then a secondary question 
to optimize over the schedule in order to get the maximum rate of transmission. 
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Figure 2: A layered network. 
Henceforth, the focus will be only on an L-layered network as shown in Figure [21 so that 

L 

V = [jOi, (5) 

1=1 

where Oi denotes the mi nodes in the l-th layer. The fc-th node in the l-th layer will be 
denoted by the ordered pair (/, k). The first layer has only one node which is the source node 
and is denoted by (1, 1) or s. The last layer has only the destination node and is denoted by 
(L, 1) or d. The nodes other than the source and the destination node will be referred to as 
the relay nodes and are denoted by Vr, i.e., 



L-l 



Vr=[jOi 



(6) 



In the layered network, the received symbol for a node in the / + l-th layer depends only on 
the transmit symbol from the nodes in the l-th layer. Therefore, for the layered network the 
channel which is denoted by a transition probability function can be simplified into a product 
across layers as follows: 



L-l 



p{yv\xv) = Y[p{yoi+Axoi) 



(7) 



1=1 



The noise across each relay node is assumed to be independent, which implies that the channel 
function for each layer is further given by. 



k=l 



Here is used to denote {xy : v G Oi}. yoi^^ are similarly defined. This models the commu- 
nication channel for the layered network. 
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In particular, if the received symbol is a deterministic function of the transmitted symbols, 

i.e., 

yot+^ = gi (xo,) , (9) 

then the network is called a deterministic network. Further, if the transmit and received 
symbols are restricted to vectors over finite fields and the deterministic function is modeled 
as a linear function, such that 

yo,+,=Gixon (10) 
then the network is called a linear deterministic network. If the network is a wireless network, 
then the alphabet sets are complex and the probability transition function linear with an 
additive complex Gaussian noise z^, such that, 

Vv ^ ^ hy^uXu ~\~ (11) 
ueOi 

where v G The wireless network is the one with the most practical interest and in 

[2] it was shown that the linear deterministic network captures many features of the wireless 
network. 



2.1 Compress-and-Forward Scheme 

In this section, the compress-and-forward scheme is described and it's performance is charac- 
terized. It is a block-encoded scheme where each node performs its operation over blocks of 
time symbols. The relay node quantizes (or compresses) the symbols it receives over a block 
of time to finite bits. These bits are then transmitted in the next block. The compression 
rate at a relay node is defined to be the rate of transmission of the compressed bits. 

Assuming that uniformly sized blocks of T symbols are used by each node for this op- 
eration, a compress-and-forward scheme is parametrized by (T, R, {r^}^^^ ), where R is the 
overall rate of communication and r^,'s are the compression rates at the relay nodes. A rate 
vector (i?, ) is said to be feasible w.r.t. the compress-and-forward scheme, if for any 

arbitrary e > 0, there exists a compress-and-forward scheme (T, R, {r^}j,gy ) which achieves 
a probability of error less than e. 

The following theorem characterizes the feasible region of (i?, {rv}^^y ) for the compress- 
and-forward scheme. 

Theorem 1. A rate vector (/?, {rt,}^^^ ) is feasible if for some collection of random variables 
|Xv,lv|, henceforth denoted by Qp, which is distributed as 

p{Xv,Yv,Yv)= (l[p{X,)] p{Yv\Xv) (n^'(^-l^-)) ' (12) 

the vector (/?, {rv}^^y^) satisfies 

R < rin'\^) + Iin;Xn\Xnc) - Ii%.;Y^c\Xv), (13) 
V n,^, s.t, S eVt<ZV,D (^Vt^, where r{A) = T.v&a^v- 
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Note 1: The choice Yd = Yd is always optimal for (fT3|) . 

Note 2: In the usual cut-set definition, the node-set is partitioned into two sets; a set containing 
the source Q and the complementary set Q'^, containing the destination. However, here the 
node set is partition into a set containing the source - Q, a set containing the destination - 
fl'^, and the rest. 

Proof. The proof is by random coding technique. A random ensemble of coding scheme is 
defined using the collection of random variables Qp distributed as given by (fT2l) . A scheme in 
the ensemble is generated as follows. 

1. Source codebook and encoding: For each message w G [2™], the source generates a 
T-length sequence x^{w) using i.i.d. p{Xs). 

2. Relay codebooks and mappings: For every relay node v E Vr & binned quantization 
codebook is generated with 2-^'"'' bins. The binned quantization codebook is given by 
y'^{wv,Wy), where Wy e [2'^^''"] and iVy G p-'"'"'']. And it is generated using i.i.d. p{Yy). 

Every relay node also generates a transmission codebook of size 2"^^", which consists of 
Xy{wy) sequences generated using i.i.d. p{Xy). 

On receiving yj, the relay node finds a vector y^{wy,Wv) in the quantization codebook 
that is jointly typical with yj, and transmits Xy{wy) corresponding to the bin number 
of the quantization vector. 

If the relay cannot find any quantization vector, it transmits a sequence corresponding 
to any bin uniformly at random. The probability that this latter event is arbitrarily is 
small is ensured by letting 

fy = I{Yy,Yv)-rv + ei, (14) 

for an arbitrarily small ei > 0. This ensures that the total size of the quantization 
codebook is of the order 2'^^^^'"^'"\ 

3. Decoding: On receiving y]^, the destination node finds a unique w, and any {{wy, Wy)}^^^ 
such that 

(^x^iw), {Y;^{wy,^y),x^{wy)} ,yl^ eJj. (15) 

If it is successful, the destination declares w as the decoded message; if not, the desti- 
nation declares an error. 

The theorem follows by the standard argument of showing that the average probability 
of error, averaged over the ensemble of codes and over all messages, goes to as T tends to 
infinity. The details of the error probability analysis are in Appendix |A1 

□ 

In the usual communication problem setup, one is interested in only maximizing the overall 
communication rate R. The following corollary of the above theorem establishes the achievable 
rate by the compress-and-forward scheme. 
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Corollary 1. The communication rate R is achievable by the compress-and-forward scheme 

R< min I{Ync;Xn\Xnc)-I{Yn;Yn\Xv,), (16) 
ncv,sen 

for some collection of random variables Qp. 

Proof. The compress-and-forward scheme with Ry = I(Yv, Yy) + ei achieves this rate. □ 

It should be noted that the achievable rate in f|T6|l is the same as the one obtained in noisy 
network coding scheme in |T] . This is not surprising as by allowing the compression rates to be 
large enough, the scheme essentially reduces to the noisy network coding scheme, where every 
quantized codeword is uniquely mapped to a re-transmission codeword at the relay node. 

2.2 A low- complexity layered decoding scheme 

A maximum likelihood decoder maximizes the probability of the received vector conditioned 
on the transmitted codeword at the source. (Note that the jointly-typical-set decoding is a 
proof technique for the random coding argument and it upper-bounds the error probability 
that can be achieved by the maximum likelihood (ML) decoder. 

ML decoder: w = argmax^p (^/dI^sI""^)) • 

The conditional probability depends on the channel model and the operations (quantization, 
compression and mapping) at each node. Therefore implementing a ML decoder has very 
high complexity. In [10], the ML decoder is implemented for a simple one- relay network 
with binary LDPC codes and a reduced quantizer operation for which the decoding reduces 
to belief-propagation over a large Tanner graph, which comprises the Tanner graphs of the 
LDPC codes for each node, the quantization and mapping operation, and the network itself. 
Even when this simplified encoding scheme is extended to a network with multiple layers of 
relay nodes, the decoding complexity would be large. In this section, a simplified decoding 
architecture is presented for the compress-and-forward scheme which operates layer-by-layer 
and decodes the compressed bits transmitted by each relay node. 

Layered decoding scheme: The decoder at the destination node operates backwards layer- 
by-layer. First, it decodes the messages (or compressed bits) transmitted by the nodes in the 
layer Ol-i- Then using these decoded messages, it decodes the messages in the layer Ol-2- 
This process continues till the destination node eventually decodes the source message. Note 
that the layered decoding scheme is the same as the backward decoding for the block-encoding 
schemes in relay networks. 

The following theorem characterizes the feasible region of [R, {rv}^^^^)- 

Theorem 2. A rate vector (i?, {r^j^^y ) is feasible for the compress-and-forward scheme, 
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under the layered decoding scheme, if for some Qp the vector (i?, {rv}^^-];^) satisfies 

r{U) < IiXu;YD\Xo,_,\u), V ^ C Ol-i, 

r{U) - riOi+AV) < I{Xu; Yy\Xo,\u) - I{Yo,^AV^ Yo,^,\v\Xo,), 

yU COi,V C Oi+i, 2 < / < L - 2, 

R - r{02\V) < I{Xs; Yy) - UXo^+AV. Yo,^av\Xs). VV^ C O^. 



(19) 
(20) 



Proof. The proof is by backward induction. Assuming that the destination has decoded the 
messages transmitted by the relay nodes in layer O^+i, the probability of error for decoding the 
messages from the layer Oi is considered. To do so, a hypothetical layered network as shown 
in Figure |3] is considered. This network consists of the layers Oi and O^+i and in addition a 
layer with an aggregator node A. A node in layer is connected to the aggregator 

node with wired link of capacity r^^^^^ bits per symbol. This layer represents the forward 
part of the network beyond layer 



O 



O 




Figure 3: A hypothetical network. 



This network is now a multiple-source single-destination relay network, with all the nodes 
in layer Oi being source nodes and the aggregator node as the destination node. The node 
V{ij) has a message for the aggregator node with rate fv(^ijy The noisy network coding scheme 

assures that the messages can be decoded with arbitrarily small probability of error, if 

r{U)-riOi+,\V) < I{Xu;Yv\XoAu) - HYv^^yvAXoX (21) 
V Oi,V C Oi+i, where the above inequality corresponds to the cut Q = U[j V. 

□ 

Note that the layered decoding scheme is weaker than the ML decoding scheme. Therefore 
the feasible region under the layered decoding scheme should be a strict subset of the feasible 
region under the ML decoding scheme. 

However, the following theorem shows that the compress-and-forward scheme with layered 
decoding achieves similar communication rate as the noisy network coding scheme. 
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Theorem 3. The communication rate R is achievable by the compress-and-forward scheme 
with layered decoding if for some collection of random variables Qp, 

R< mill I{Ync]Xn\Xn.)- Ki, (22) 
ncv,5en 

where the constant Ki is given by the recursive relation, 

Ki = I{Yo,^^]Yo,jXo,) + tii+i\Oi+il (23) 

and = 0. 

Proof. The above theorem wiU be proved by characterizing an information flow for the network 
in the Section \^72\ □ 

Note that the conditions of Theorem [2] can be interpreted as a flow decomposition for the 
layered network. If R is the information that flows from the source to the destination, then the 
flow decomposition gives the effective amount of information that flows through each node. 
If the compression rate at each relay node is made approximately equal to the information 
flowing through that node, then the layered decoding where the destination ends up decoding 
the effective information at each node has a chance to work. Thus, in order to choose the 
right compression rates at each node, a flow decomposition for the network must be obtained. 
These notions are made more precise in the next section. 

Remark 1. Assuming the maximum likelihood (ML) decoding is done by an exhaustive search 
as given by f|T5|) . the decoding complexity of the joint decoding is the product of the codebooks 
of all the nodes. Therefore the complexity of the joint- decoding is given by 

veVr 

where nq^v is the number of quantization points in the relay quantization codebook. With the 
compress-and-forward scheme with the layered decoding, the complexity is reduced to 

L-l 

1=1 veOi+i 



3 Flows with Bisubmodular Capacity Constraints 

Maximum flow problems are extensively studied in graph theory and combinatorial optimiza- 
tion [11]. The problems are most often motivated from the study of transportation and 
communication networks. A directed graph (V, S) consists of the set of vertices or nodes V 
and the set of edges £^ C V x V. Traditionally, flow is defined to be a non-negative function 
over the set of all edges which satisfy the flow-conservation law at each vertex other than the 
source and the destination node. Further, the flow over any edge is less than the capacity of 
that the edge. The classic max-flow min-cut result of [12] characterizes the maximum flow 
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from the source to destination node and shows it to be equal to the min-cut of the graph. In 
order to distinguish from the concept of the node-flow that will be introduced here, such a flow 
is called an edge flow over an edge-capacitated graph. Beginning from the single commod- 
ity result of [I2], various extensions of these problems have been considered. In particular, 
the edge-capacitated graph was extended to a polymatroidal network [13], where the flow is 
constrained not only by the edge-capacities but by joint capacities on sets of incoming and 
outgoing edges at every vertex. A special case is the node-capacitated graph[ll], where the 
constraints on the flow are on the sum-total of the incoming and outgoing flow at each node. 

In this section, the concept of a node-flow in the context of a layered graph with bisubmod- 
ular constraints on the flows is introduced. The node-flows can be related to the edge-flows 
with flow-conservation at the node. Note that the conservation law for edge-flow enforces 
that the net incoming flow at any node is equal to the net outgoing flow at the node and this 
quantity can be viewed as the node-flow for a node. The bisubmodular constraints can be 
viewed as generalizations of the polymatroidal constraints of [12]. The definitions here are 
motivated by the layered coding scheme for the wireless network, which was presented in the 
previous chapter. The main result is a max-flow min-cut theorem for the single-commodity 
node-flow for a graph with bisubmodular capacity constraints. The result is closely related 
to, and can be viewed as a generalization of, the flow introduced in the context of the linear 
deterministic networks and polylinking systems in [HI [9] . 

3.1 A max-flow min-cut theorem 

In this section, the max-flow min-cut theorem is proved for single- commodity node- flow on a 
layered graph with bisubmodular capacity constraints. 

Layered graph: A layered graph is considered, which is represented by a set of nodes V, 
which can be decomposed into subsets Oi,l < I < L a.s shown in Figure [21 The layering 
is ensured by the edges of the graph, which connect nodes in any layer / to nodes in the 
subsequent layer / -|- 1. Since the edges do not play any role in the problem here, beyond 
ensuring the layering, they will henceforth be neglected. The first layer Oi has a single node, 
which is the source node and the last layer Ol has a single node, which is the destination 
node. 

Bisubmodular capacity functions: The bisubmodular capacity functions are defined for the 
layered graph using a family of L — 1 functions 

{p; : 1 < / < L — 1}, p; : 2*-^' x 2'-''+i R"^, which satisfy the following properties: 
1. pi is bisubmodular, i.e., WUi, U2 C Oi, Vi, V2 C Cj+i, 



PiiUi uU2,Vin V2) + piiUi n U2, Vi u V2) < piiUi, Vi) + pz([/2, V2). 



(26) 



2. pi is non-decreasing, i.e. 




(27) 



3. If t/ = or 1/ = 0, then 



Pi{U,V) = Q. 




11 



Node-flow: The node-flow for tlie layered grapli is defined as a function / : V — M'^ wliicli 
satisfies the capacity constraints, i.e., 

f{V) - f{Oi\U) < pi{U,V), WUCOi,VC Oi+^yi e[L- 1], (29) 

where f{A) is an over-loaded notation, such that when A C V then f{A) =^ ^v^Afi'^)- 
Further, the destination node must sink the flow from the source. Therefore f{D) = f{S). 

The max-flow problem is to find the maximum f{S) that can be supported given the 
capacity constraints on the graph. An efficient algorithm to compute the flow at each node 
given any f{S) that can be supported is also sought. 

An upper bound on the max-flow is given by the cut function. 

Cut function: The cut function C : 2^ — IR+ is defined as 

CiQ) = Y,Pii^i,Oi+i\^i+i)^ (30) 
1=1 

where fi, = n C;. 
Clearly, 

max/(5) < minC(fi). (31) 

The next theorem shows that the min-cut is achievable. The proof is constructive and 
gives and efficient method of computing the flow. 



Theorem 4. 



maxf{S) = mmC{n). (32) 



Proof. The proof is based on the polymatroid intersection theorem. The details are in Ap- 
pendix [HI □ 

The max-flow min-cut theorem for node-flows with bisubmodular constraints presented 
here is closely related to the max-flow min-cut results of [HI |9]. [8] considered linear deter- 
ministic networks, which led to bisubmodular capacity functions arising from the rank of a 
matrix. [9] considered polylinking systems, where the bisubmodular capacity functions are 
given by the polylinking function. The results of [9] generalized the results of [8] by showing 
that a linear deterministic network is a special case of polylinking system. 

The max-flow min-cut theorem can be easily generalized to the following two cases: 

• Multi-source: Consider a layered graph with J source nodes in Oi and a single des- 
tination node in Ol, such that f{Oi) = f{D). For this case, the following corollary 
generalizes Theorem HJ 

Corollary 2. {f{v)\v G Oi] is a feasible flow iff, 

f{ni)<C{n), VfiCV, (33) 

where Qi'^ Q (1 Oi. 
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• Multi-destination: Consider a layered graph with a single source node in Oi and J 
destination nodes in Ol, such that f{S) = /{Ol)- For this case, the following corollary 
generalizes Theorem HI 

Corollary 3. {f{v)\v G Ol} is a feasible flow iff, 

f{nL)<c{n), yncv, (34) 

where fi^ =^ f2 fl Ol- 



Note that the proof for the multiple sources (or destinations) case follows by adding a 
hypothetical supernode A in layer (or L + 1) with capacity functions po (or Pl) given by 
Po{A,V) = Zf{v),yVCO, {or pL{V,A) = Zf{v),yVCOL). 



3.2 Proof of Theorem [3t A Compress-and-Forward Scheme from 
Flows 

In this section, Theorem [3] is proved by establishing a connection between the compression 
rates of the compress-and-forward scheme with the layered decoding and the node-flows with 
bisubmodularity constraints. Recall that the achievable rates for the compress-and-forward 
with the layered decoding scheme are given by ( fT8l) - (l20l) . which appear very much like the 
bisubmodular capacity constraints. 

To make this connection more precise, first observe the following proposition. 

Proposition 1. Given the collection of random variables Qp distributed as given by (fT2|) . the 

family of L — 1 functions pi : Oi x Oi+i — )■ M"*", V/ G [L — 1] defined by 

pi{U,V) = I{Xu;Yv\XoAu) (35) 
forms a family of bisubmodular capacity functions. 

Proof. Appendix [Dl □ 
For any ^2 C V, the corresponding cut value C{Q) is now given by 

L-l 

C{n) = J2liXn,;Yo,^,\n,jXo,\n,) (36) 
1=1 

= I{Yn,-Xn\Xnc). (37) 
Theorem m is then used construct a flow f{v) for this network, such that 

fiS) <mmI{Yn.;Xn\Xnc), SeCl^Den', (38) 

and 

fiV) - fiOi\U) < piiU,V), \/UCOi,VC Oi+u'^l e[L- 1]. (39) 
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For any v e Oi,l E [L - 1], let 

r, = f{v)-Ki, (40) 

and R = f{S) — ki, where ki is given by fl23|l . 
Then Vf/ 7^ C C,, V C Oi+i, 

r{U) - r{Ol+^\V) = f{U) - f{Oi+i\V) - \U\ki + \Oi+i\V\ki+, (41) 
<PiiU,V)-Ki + \Oi+i\Ki+i (42) 

= PiiU,V)-IiYo,^,;Yo,jXa,) (43) 

< nXu; Yy\Xa,\u) - HYo.^aV^ Yo,^,\v\Xo,) . (44) 

Therefore (i?, {r J^g^J satisfies (Il8D-([20D. This proves Theorem [3l 

4 Generalizations to multi-source networks 
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Figure 4: A layered multi-source network. 

The communication network with multiple source nodes {Si\i G [J]} is illustrated in Figure 
m The source node Si has independent message Wi at rate i?,. There is a common destination 
node D. The multi-source relay network was perhaps first studied in fT^l [TB]. where the rate 
region for the deterministic case and an approximate rate region for the Gaussian case were 
established. The noisy network coding scheme of [1] extends to this case as well. In fact this 
result was used for each layer to analyze the layered decoding scheme in the proof of Theorem 

m 
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The results of the compress-and-forward scheme and the layered decoding scheme can 
be generalized to the communication network with multiple source nodes and a common 
destination node. 

The following corollary extends the results of the compress-and-forward scheme for the 
unicast network to the multi-source relay network. 

Theorem 5. The communication rates R = {Ri,...,Rj) are achievable by the compress- 
and-forward scheme (with joint decoding) for the multi-source single destination network if, 
for some collection of random variables Qp which is distributed as f|T2|) . the rates satisfy 

< I{Ync; Xn\Xn^) - liY^; l^nl^v, ), V 1], s.t, C V, D G (45) 

where VLi =^ fl Ci . 

Further, with the layered decoding scheme, the rates R = {Ri, . . . , Rj) are achievable if 

R{n,) < I{Yn^; Xn\Xnc) - (46) 

where Ki is given by (123!) . 

The results can be proved by adding a hypothetical supernode in layer 0, which is connected 
to the source nodes with orthogonal wired links such that the wired link to node Si is of rate 
Ri- 



5 Special cases 
5.1 Wireless network 

For the special case of the Wireless network described by (fTTl) . the achievable rates can be 
compared to the cutset bound ^7\. 

As noted in [1] , a good choice for for the Gaussian network is given by 

n = n + Z,, (47) 

where Zy ~ CJ\f{0, 1) is independent across nodes. 

The particular choice of Y^ implies that the quantization is done at the noise level. This 
also agrees with the philosophy in [21 H] , where the quantization was done at the noise level 
to show approximate optimality; in [2], scalar quantization was done at the noise level, and 
in [1], quantization was done using the discrete superposition network, which was a model 
obtained from the wireless network by clipping the signal at the noise level. 

As shown in [1], with this choice of Yy and with Xy ~ CA/'(0, /), 



I{Yn^;Xn\Xn^)=\og 



1 + 



2 



(48) 



>log\I + Hnn^H*^nA-^-^- (49) 
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And further, 

/(f;;n|Xv)<l. (50) 
Using ( H9|) . (150|) and Lemma 6.6 in [2], the following corollary of Theorem [5] follows. 

Corollary 4. If R = {Ri, . . . , -Rj) «s m the cutset bound, then rates i?— 3|V|r are achievable by 
the compress-and-forward scheme (with joint decoding) for the multi-source single destination 
Gaussian network. Further, with the layered decoding scheme, the rates R— (2|V| + kI)1 are 
achievable, where 

Kf = l + Kl,\Oi+,l (51) 

and K^i^i = 0. 

5.2 Deterministic network 

For the special case of the deterministic network described by (Q, the optimal choice of Y^j is 
and with this choice 

IiYn.;Xn\Xnc) = H{Ync\Xn^). (52) 

And further, 

/(i;;K|Xv) =0. (53) 
Therefore, specializing the results of Theorem [S] leads to the following corollary. 

Corollary 5. For the multi-source single- destination deterministic network, R = (i?i, . . . , Rj) 
is achievable by the compress-and-forward scheme with the layered decoding scheme if for some 
collection of random variables Qp which is distributed as ( |T2l) . 

ReCiQp), (54) 
where C{Qp) is the cutset bound evaluated under the product distribution for the network f^. 

Specializing further to the linear deterministic region, it can be shown that the product 
distribution (with uniformly distributed over all input alphabets) maximizes the cutset 
bound, thereby showing that all rates in the cutset bound are achievable. 

6 Conclusion 

In this paper, the compress-and-forward scheme is analyzed for the unicast relay network. It 
is shown that while it achieves the same overall rate as NNC, it allows for a lower complexity 
layered/backward decoding algorithm. However, this requires each relay node to compress 
their information to the right amount. This paper also presents a computationally efficient 
way of finding the optimal compression rates at each relay node using a node-fiow formulation 
over a bisubmodular constrained graph. 
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A Probability of Error Analysis for CF Scheme 

Without loss of generality we assume that the message with index 1 is transmitted at the 
source and the index corresponding to the quantized vectors at each node is (1, 1). We will 
find the probability of error that this message is wrongly decoded at the destination. We 
denote by £w,{w,w)^ the event that 

[x^siw), {yl,k){w{i,k), ^c.fc))' ^J,fc)(^a.fc))}(i,fc)gv. ' ^ ^^^^ 

Here {w.w)^^ is shorthand for {{wy^Wy) \v G Vr}- The error event is the union of two terms 
and is given by 

IJ £i,{w,w)v,. I U I U ^w,{w,w)v^ I • (56) 

The first term corresponds to the event that the transmitted message is not jointly typical and 
the second term corresponds to some other message other than the transmitted being jointly 
typical. The first event can be upper bounded by E'^f^-^ . For any f2 C V^, and $ C Vr\f2, 
let 

©J^,* '= {{w, {w,w)vr)\w ^ 1, W(;,fc) ^ lW{l,k) e Q, 

w^i^k-) = 1, w^i^k) = 1V(/, A;) G $} , (57) 

and 

^n,<s> = [J ^w,{w,w)v^- (58) 

&n,<s> 

The second event can be equivalently written as, 

[J £w,{w,w)v, = [j£n,'i>, (59) 

The probability or error by union bound can be upper bounded by, 

^{error) < n^l^i,i),^ ) + E ^ (^^.*) " ^^0) 

n,<i> 

Prom the properties of joint typicality, it can be shown that the first term goes to and 
T — )• oo. It can be shown that 

p [Sn $) = 2^'^'^+''(^^+^(*"^)2^('^^'*^'^''^*''^*"''^"''^""''^'=^"'^^^ 

^ 2nR+rin)+fi^-))2T{H{Yi,%,%c\Xi,,Xnc,Xs)-H{Ya,%\Xnc)-T,ii,t^^^^ 
^ 2nR+riQ)+fi'l>''))2-T{H{Ya,Y^\Xnc)-H{YaS-i\Xn,Xnc,Xs)+Y:(^i,k)^^^ 
^ 2nR+rin)+f{'S--))2-T{liYa,%;Xn,Xs\Xnc)+j:^^l,^^^^cI{Y^^ 
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Here r(A) =^ YIv&a''^^- Using the Markovian property of the random variables, we have that 
and using (fT4l) we have 

Therefore P (^n,*) ^ 0, if 

R < r{Q'\<^) + I{Ya,%;Xn,Xs\Xnc) - I{%c;Y^c\Xv^,X,). (63) 



B Proof of Theorem [4 



The theorem will be proved in a slightly general setting, allowing multiple nodes in layer Oi 
and layer Ol- Assuming that the flow values for these layers Oi and Ol are given and satisfy 

/(Oi) = /(0^), (64) 

f{ni)-f{nL)<c{n), wncv, (65) 

the flow for all intermediate layers will be constructed. 
The proof is by inductive construction. 

For L=2, there are no intermediate layers and the theorem holds by definition. Consider 
L > 2. The induction hypothesis assumes that the flow can be constructed with fewer than 
L layers and the flow for the boundary layers are specified with the constraints given by 0651) 

Consider any Lq G {2, . . . , L — 1}. Define networks N'a and JVb to be the sub-networks of 
A/" with the set of vertices Va = U^\Ci and Vb = ^iLlq^i respectively. Similarly, denote the 
cut for the two networks by Ca and Cb respectively. 

Next, a flow for the layer Olq will be constructed which satisfies the following conditions. 

/(O^J = /(Ol), (66) 
/(fi^nOi)-/(fiAnOLj < Ca(I^a), V^aCVa, and (67) 

/(f^B n - /(i^B n Oi) < Cb(i^b), vi^bCVb. (68) 

The induction hypothesis would then guarantee that the flows for the intermediate layers in 
the sub-networks Ma and Mb can be constructed. 

Using fl66|) . the set of linear inequalities given by (167|) can be written as, 

/(fi^nOLo)-/(fi^nOi) < C^(fiA), Vfi^C Va, (69) 

where fi^ = Va\^a- For any fixed T C C^^, the collection of inequalities where Q'^^DOlo = T, 
can be concisely represented as, 

/(T) < min{CAinA) + f{n'A^O,):Q'XnOL, = T}. (70) 
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rA{T) = mill {CA{nA) + f{^\ n Oi) : n\ H Ol, = T} , (71) 



Defining 

the set of linear inequalities given by fl67|) can be concisely written as, 

/(T)<r^(r), MTCOl,. (72) 

Similary, defining 

rsiT) = miniCBi^B) + /(^^b H O^) : fi,, H Ol, = T} , (73) 
the set of linear inequalities given by fIBS]) can be concisely written as, 

f{T)<rB{T), ^TCOl,. (74) 
The following properties for the functions ^^(T) and rsiT) can be established. 
Lemma 1. The functions r a{T) andrsiT) are 

• suhmodular, 

• non- decreasing, and 

• satisfy ta^^) = and rB{^) = 0. 

Proof. Appendix O □ 
Define the following polymatroids with the functions rA and r^. 

Pa = {^eM.";:'° ■.x{U)<rA{U),yUeOL,} (75) 
Pb = {^eR+'°:x{U)<rB{U),\/UeOL,}, (76) 



where x = [x{l) . . .x^niLo)] and x{U) =^ J2ueu •'^i''^)- '^^^ conditions (I66l)- (l68|) are now equiv- 
alent to finding 

[/(Lo,l).../(Lo,mz.o)] gFaPFb, (77) 

such that /(Olo) — /(C^i)- It then follows from Edmond's polymatroid intersection (|11]. 
Corollary 46.1c) that: 

max {x{Ol,) : x G Pa n Fb} = min {rA{OL,\T) + rsiT)} . (78) 

Therefore the required flow exists since 

/(Oi) < mm {rA{OL,\T) + rsiT)} (79) 

= mm{Cin) + fiO^\n^) + finL)}. (80) 

Further, in Theorem 47.1 of [H] it is shown that the maximizing x in ( ITHl) can be computed 
in polynomial time in the dimension of x. Hence, the flow can also be computed in polynomial 
time in the number of nodes. 
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C Proof of Lemma [T] 

We will prove the lemma for rB{T). The proof for r^(T') is similar. 



1. Submodularity: 
Let, 



Since, 



{Q^i^UQf)nOL, = T(i)uT(2), (83) 
(l^iJ^ n l]g V Clo = T(i)nT(2), (84) 



it follows that 



^^(3^(1) ur(2)) < CB(f^i;^ufig)) + d((fig^ufigV(^L), (85) 
rB(T« nr(2)) < CB(^^i;^nfig^) + (i((fiy)nfigV(^L)- (86) 

By definition of cut and the bi-submodularity of pi, it is easy to verify that Cb{^b) 
is submodular. And since d is an additive function, it then follows that rB{T) is sub 
modular. 

2. Non-decreasing: 
Consider T^^) C T^^). Let 

r5(TW) = CB{n^B^) + d{n^i^nOL), i^g^no^, = tw. (87) 

Let ilB = n^B^ U T(2)\T(i) D l]g\ so that VIb n Cl„ = T^^). By the definition of 
cut and the non-decreasing property of pi, it follows that Cb{^b^) < Cb{^b)- Also 
d(l]g^ n Ol) < di^B n Ol). Therefore 

rB(T(2)) = CB(l^i?) + rf(f^BnOi) (88) 
> CB(fig^) + rf(l^g^nOL) (89) 
= rB(T«). (90) 

3. rB(0) = 0: 

When T = 0, by letting Qb = 0, it follows that rij(0) = 0. 
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D Proof of Proposition [T] 



We need to show that I{Xu;Yw\Xoi\u) satisfies the three properties of channel functions. 
Firstly we show that it is bi-submodular. 

I{Xu;Yw\Xo,\u) = H{Yw\XoAu)-H{Yw\Xo,) (91) 
= H{Yw,Xa,\u)-H{Xo,\u)-H{Yw\Xa,). (92) 

The submodularity of entropy |T8] implies that H{Y[y, Xa^\u) is bi-submodular. 

The submodularity of entropy follows from the fact that given collection of random vari- 
ables Ti and T2, we have 

F(Ti) + //(T2) - if(Ti U T2) - if(Ti n T2) = /(Ti\T2;T2\Ti|TinT2) (93) 

> 0. (94) 

The product form of the random variables implies that H{Xci^\u) and H{Yw\Xc>i) are modular 

or additive. Therefore, I{Xu]Yw\Xoi\u) is bi-submodular. 

Next, we show the non- decreasing property. Given Ui U ^ Oi and Wi C C Oi+i, we 
have 



I{Xu',Yw\Xoi\u) = H{Xu\Xoi\u) — H{Xu\Xai\uyw) (95) 

> H{Xu\Xo,\u) - H{Xu\Xo,\uYwJ (96) 
= I{Xu;YwAXo,\u) (97) 
= H{YwAXo,\u)-H{YwAXo,) (98) 

> H{YwAXa,\uJ - H{Yw,\Xa,) (99) 
= I{Xu,;YwAXo,\u,), (100) 



where both the inequalities follow from the fact that conditioning reduces entropy. 
The third property is readily seen. 
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